AD 


Award Number: W81XWH-11-1-0488 


TITLE: "Development of Advanced Technologies for Complete Genomic and Proteomic 
Characterization of Quantized Human Tumor Cells" 


PRINCIPAL INVESTIGATOR: Dr. Charles Cobbs 


CONTRACTING ORGANIZATION: Swedish Health Services 


Seattle, WA 98122-4379 


REPORT DATE: July 2014 


TYPE OF REPORT: Annual 


PREPARED FOR: U.S. Army Medical Research and Materiel Command 
Fort Detrick, Maryland 21702-5012 


DISTRIBUTION STATEMENT: Approved for Public Release; 
Distribution Unlimited 


The views, opinions and/or findings contained in this report are those of the author(s) and 
should not be construed as an official Department of the Army position, policy or decision 
unless so designated by other documentation. 


REPORT DOCUMENTATION PAGE Bien eae 


Public reporting burden for this collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the 
data needed, and completing and reviewing this collection of information. Send comments regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing 
this burden to Department of Defense, Washington Headquarters Services, Directorate for Information Operations and Reports (0704-0188), 1215 Jefferson Davis Highway, Suite 1204, Arlington, VA 22202- 
4302. Respondents should be aware that notwithstanding any other provision of law, no person shall be subject to any penalty for failing to comply with a collection of information if it does not display a currently 
valid OMB control number. PLEASE DO NOT RETURN YOUR FORM TO THE ABOVE ADDRESS. 


1. REPORT DATE 2. REPORT TYPE 3. DATES COVERED 
July 2014 Annual 15 Jun 2013 — 14 Jun 2014 


4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER 


"Development of Advanced Technologies for Complete Genomic and Proteomic 


Characterization of Quantized Human Tumor Cells" 5b. GRANT NUMBER 
W81XWH-11-1-0488 


5c. PROGRAM ELEMENT NUMBER 
6. AUTHOR(S) 5d. PROJECT NUMBER 
Dr. Charles Cobbs 5e. TASK NUMBER 
Email: charles.cobbs@swedish.org 5f. WORK UNIT NUMBER 


7. PERFORMING ORGANIZATION NAME(S) AND ADDRESS(ES) 8. PERFORMING ORGANIZATION REPORT 
NUMBER 


Swedish Health Services 
Seattle, WA 98122-4379 


9. SPONSORING / MONITORING AGENCY NAME(S) AND ADDRESS(ES) 10. SPONSOR/MONITOR’S ACRONYM(S) 
U.S. Army Medical Research and Materiel Command 
Fort Detrick, Maryland 21702-5012 
11. SPONSOR/MONITOR’S REPORT 
NUMBER(S) 


12. DISTRIBUTION / AVAILABILITY STATEMENT 
Approved for Public Release; Distribution Unlimited 


13. SUPPLEMENTARY NOTES 


14. ABSTRACT With the establishment of Glioblastoma (GBM) cell lines from GBM patient’s tumor samples and quantized cell populations of each 
of the parental GBM cell lines, we have completed most of our major aims of this project. We will continue in our efforts in the development and 
analyses of these quantized cells and develop the genomic and proteomic technology to interpret specific transcriptome and proteome signatures. 
Whole genome sequencing from two families of GBM patients are now well established and from the basis of the molecular characterization of the 
tumor development and signatures presented by these tumors for the development of diagnostic signature panels. We have established efficient 
secretome cell culture conditions to enable proteomic analysis of these quantized cell sub populations and have begun to assemble the protein 
signatures of GBM tumors underpinned by the comprehensive molecular characterization of these tumors through family WGS, transcriptomics of 
single cells and secreted protein analysis. This program will ultimately provide new technical capabilities through broad based molecular 
characterization at the genomic and proteomic analysis of GBM patients and their immediate families and has already provide significant results. 
This final effort in the molecular characterizatio patient and their family and the establishment and deployment of a proteomic 


15. SUBJECT T 
Human cohorts, Glioblastoma, Genomic, Proteomic, Single-cell technologies, Hypothesis-driven, integrative systems 
approach, Early diagnosis,Patient stratification,Blood protein biomarkers,Quantized cell populations 


16. SECURITY CLASSIFICATION OF: 17. LIMITATION 18. NUMBER 19a. NAME OF RESPONSIBLE PERSON 
OF ABSTRACT OF PAGES USAMRMC 
a. REPORT b. ABSTRACT c. THIS PAGE 19b. TELEPHONE NUMBER (include area 
U U UU 20 code) 


Introduction 


Body 


Key Research Accomplishments 


Reportable Outcomes 


Conclusions 


References 


Appendices 


Table of Contents 


Page 


18 


19 


19 


20 


N/A 


INTRODUCTION 

This collaborative research program will provide important insights into human cancer 
mechanisms. In particular, we have developed quantitative tools with direct applications for 
patients with glioblastoma multiforme (GBM), the most common and most aggressive brain 
tumor in adults. Despite treatment strategies that include gross total resection, post-operative 
radiation and chemotherapy, most patients develop tumor recurrence within months. Cancer 
stem cells have been implicated as the presumed cause of tumor recurrence and resistance to 
therapy. With this in mind, we will utilize GBM patient-derived cell lines and new technologies for 
transcriptome, miRNAome, proteome, and single-cell analyses to study quantized GBM cell 
populations and their role in disease progression. This proposal will significantly advance 
genomic, proteomic and single-cell technologies. The tools proposed here will be generally 
applicable to all cancer-based studies, as the nature of the tool development is designed to 
identify and quantify DNA, RNAs, proteins and cells, challenges ubiquitous to all human disease 
systems. 


The expected outcomes of this research program include: 1) a deeper understanding of human 
GBM disease mechanisms; 2) blood protein biomarkers for use in early diagnosis, assessment 
of GBM progression, evaluation of drug treatment effectiveness, and early detection of disease 
recurrence; 3) new strategies for advanced genomic sequencing of quantized cancer cells and 
their normal counterparts to identify cancer-driver mutations; 4) new technologies for 
transcriptome, miRNAome, proteome, and single-cell analyses, and 5) the creation of quantized 
GBM cell lines that can be used for general molecular characterization as well as to assess 
GBM biology and the effectiveness of existing drugs in reacting with these cell types. To 
achieve these goals we have pursued the following aims: 


Specific Aim 1. Isolate up to 1000 cells from each of five human GBMs and quantify initially 
500 different transcripts from each cell (transcription factors, CD molecules, relevant signal 
transduction pathways, etc). Determine whether computational analyses can classify these cells 
into discrete quantized cell types. 


Specific Aim 2. Sort the disassociated tumor cells from GBM into their quantized cell 
populations using cell sorting/CD antibodies to each quantized cell type for functional analyses 
and establish primary cell lines. These will be used for molecular analyses at the genome, 
transcriptome, miRNAome and selected proteome levels. 


Specific Aim 3. Assess 20-40 candidate blood biomarkers in the bloods of 100 GBM patients 
with regard to their ability to stratify disease, assess disease progression and predict at an early 
stage GBM recurrence. Eventually we will use these biomarkers to assess the effectiveness of 
therapy. 


Specific Aim 4. Ten to 20 cells from each major quantized GBM cell type from two patients will 
be used to determine the complete genome sequences. We will also determine the normal 
genome sequences of each patient and their family members to enable the Mendelian-based 
error correction process. The mutations will be analyzed against quantitative changes in the 
transcriptomes, miRNAomes and proteomes and against the relevant biological networks. 


Specific Aim 5. Analyze the quantized cell populations for their responses (transcriptome, 
miRNAome, etc) to the perturbations of key GBM-relevant molecules (e.g. nodal points in 
networks) by RNAi perturbations as well as their responses to GBM-relevant drugs and natural 
ligands. 


BODY 
Specific Aims 
1, 2 & 4. During 


the past year, 
we continued to 
collect GBM 
tumor samples 
to establish the | Figure 1. Immunostaining for differentiation markers: (A) GFAP/astrocytes, (B), TUJ- 
patient cohort | 1/neurons and (C) O4/oligodendrocytes. Parental GBM cultures established at SNI are 
available for multipotent and able to differentiate into astrocytes, neurons and oligodendrocytes. 
molecular analysis, genome sequencing, and quantitative assays. The lvy Center for Advanced 
Brain Tumor Treatment at the Swedish Neuroscience Institute (SNI) has collected tumor tissue 
eligible for this program from over forty GBM patients to date. Several tissue specimens were 
processed to establish primary GBM cell lines and tested to confirm the presence of the stem 
cell phenotype. In brief, tumor samples were treated with 
Accutase (Sigma) immediately after surgical resection, 
and single cell suspensions were plated in neurobasal 
media with epidermal growth factor (EGF) and fibroblast 
growth factor (FGF-2) as described (7, 2). Although the 
criteria that define cancer stem cells are controversial (3- 
6), it is generally accepted that GBM stem cells must 
demonstrate self-renewal, multipotency, and  tumor- 
initiating ability in vivo (7). Our patient derived cultures 
contain cells capable of forming self-renewing spheres 
(signifying the presence of a self-renewing stem cell 
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Figure 2. Average tumor volume (n = 5) in 
immunocompromised mice six weeks after 
implantation of GBM-patient derived cells | population), are multipotent (Figure 1), and have the 


Ss inset: Tumor mass removed fom | ability to form tumors in NOD/SCID mice 6-8 weeks after 
ost. Tumor initiation in vivo confirms the 


presence of stem cell populations within the | cell inoculation (Figure 2). These results demonstrate the 
ll culture. : ae . : 
dic ieee le presence of stem-like cells within our patient-derived GBM 


cultures, and verify that these cell lines are suitable for the proposed research. 
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rae | wee [ cel GBM (Gliosarcoma), grade IV | Left Temporal | Mesenchymal | Unmethylated Not available Not available 


140mg TMZ, over 11 weeks (concurrent with |IMRT, 4500 cGy in 25 fractions, 
186 | Male 76 GBM, grade IV Right Temporal | _Proneuronal Unmethylated_|radiation). over 6 weeks. 


TMZ, concurrent, 6 weeks; MRT, 4140 cGy in 23 fractions, 


TMZ, maintenance 5x/mo, 38 weeks; |concurrent, 3 weeks. IMRT, 
TMZ, maintenance 21x/mo, 8 weeks; |1800 cGy in 10 fractions, boost, 
243 | Male 57 GBM, grade Right Frontal Proliferative Methylated TMZ, maintenance 5x/mo, 8 weeks. _[3 weeks. 


MRT, over 8 weeks 
150mg TMZ, every 2 weeks 5 days cycle, for |Stereotactic, 2500 cGy in 5 
291 | Female| 63 GBM, grade IV Right Parietal_| Mesenchymal Methylated [54 weeks. factions, 1 week. 


MRT, 5940 cGy in 33 fractions, 
348 | Female | 49 GBM, grade IV Right Frontal_| Not determined| Unmethylated |105mg TMZ, concurrent, 7 weeks. 7 weeks. 


Table 1. Clinical diagnosis, treatment history and survival of GBM patients used in this study. 


We have transferred several of these GBM-derived cultures (as summarized in Table 1) to our 
collaborators at the Institute for Systems Biology (ISB; Award Number W81XWH-1 1-1-0487, Dr. 
Robert Moritz) for the generation of quantized cell populations. These cell populations serve as 
the foundation for subsequent genomic, transcriptomic, and proteomic studies. In particular, the 
GBM specimens SN243 and SN291, for which we have consenting family members, were 
selected for complete molecular analyses. We collected blood (processed as plasma and 
peripheral blood mononuclear cells [PBMCs]), from both SN243 and SN291 patients, and their 
family members during the last year (as summarized in Table 2). This completes the specimen 
cohort required for moleucular analyses at the genome, transcriptome, miRNAome and 
proteome levels (Specific Aims 1 and 2). 


A number of 
quantized cell 
SN243-P1 Parent Male populations have 
SN243 SN243-P2 Parent Female 
Male, 56 SN243-C1 Child Female 
SN291-S1 Sibling Female ; 
SN291 SN291-C1 Child Male the corresponding 
Female, 62 SN291-C2 Child Female SN243 and SN291 
SN348-P1 Parent Male parental cell lines 
SN348 SN348-P2 Parent Female (Figure 3). To 
Female, 48 SN348-S1 Sibling Female 
SN348-C1 Child Female 


Patient # Family # Relationship Gender 


been successfully 
established — from 


generate quantized 
cell populations a 
single cell clonal 
culture technique, integrated with single cell sorting using the BD FACS Aria II, was developed. 
Approximately 60% of the sorted cells formed colonies (>100 cells) and were collected and 
frozen for further analysis. For each primary tumor line, we established clonal cultures which 
exhibited distinct morphological phenotypes. Given that each clone presumably carries a 
uniform genome, it is suitable for whole genome sequencing. 5 clones have been selected from 
each patient for subsequent ‘omic analysis. 


Table 2. Blood samples collected for whole genome family sequencing 
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Figure 3. Establishment of single cell clonal cultures from GBM patient SN291. A total of 12 clonal 
cultures have been generated from this patient sample. In brief, cells were cultured on plates coated 
with laminin and grown under serum-free conditions with stem cell media supplement with B27, N2 and 
the growth factors EGF and bFGF. Single cell sorting was performed using the BD FACS Aria Il. 
Approximately 60% of the sorted cells formed colonies (>100 cells) and were used for further analysis. 


Sample preparation for whole genome sequencing 


Genomic DNA has been extracted from 
both SN243 and SN291 specimens for 
whole genome sequencing. For SN291, 
the specimens include tumor tissue, 
normal sequence coverage of primary 
tumor cell culture, 5 quantized cell 
populations, patient-derived PBMCs and 
PBMCs from the patient’s two children. 
The samples for SN243 include tumor 
tissue, primary tumor cell culture, 5 
subclones, patient-derived PBMCs, and 
PBMCs from both the patient’s parents 
and one child (Figure 4). The quality of 
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Figure 4. Patient selection and family history of 
samples submitted for whole genome sequencing. 


each DNA prep was validated prior to submission to Complete Genomics (Mountain View, CA) 


for whole genome sequencing analysis. 


A total of 22 DNA samples were submitted to Complete Genomics. All ten DNA samples from 
patient SN291 and family members passed quality control by CGI. High quality whole genome 
sequencing data was generated and sent back to our collaborators at ISB for further analysis 
(Figure 5). As shown in Table 2, eleven out of the twelve SN243 samples passed initial CGI 
quality control, and were moved down their pipeline for sequencing analysis. However, we were 
recently informed that DNA samples from two clonal populations (4 and 12) unfortunately failed 
insertion of one of the adapters. SN243 clone 4 and clone 12 are therefore being expanded 
again. Once sufficient cells are available, genomic DNA will be extracted from SN243 clone 4 
and clone 12 for resubmission to CGI. 


Excised GBM 


SN291_Tissue 
SN291_PBMC (2717_G01) 
(2717_F01] GS02717-DNA_F01_G01_H01_250_37-ASM-T1 


PS1_GS000035756-ASM-N1 
GS02717-DNA_F01_G01_H01_250_37-ASM-T2 


SN291_cell_line 
[| O (2717_H01) 


SN291_Child1 SN291_Child2 
2717_A02—2717_802 Ss SS Sa iS 


GS000035705 GS000035715 SN291_clone2 SN291_clone3 SN291_clone4 SN291_clone5 $N291_clone10 
[2717_A01] [2717_B01] (2717_C01] [2717_D01] [2717_E01] 


GS000035642-ASM-T1 GS000035642-ASM-T2 GS000035677-ASM-T2 


GS000035642-ASM-T3 GS000035677-ASM-T1 


Figure 5. Overview of patient SN291 genome dataset. For each sample, the descriptive identifier, the 
vendor sample identifier between square brackets, and the vendor assembly identifier is shown. 
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Table 3. CGI quality control template for 12 genomic DNA samples from SN243 family. 


Analysis of Whole genome data: Karyotype computed from genome data 

Our collaborators at ISB have developed a sophisticated method for the identification of 
aneuploidies at high resolution. This has been applied in the analysis of the SN291 genomes 
(see PI Moritz report). Results are presented in Figure 6 below. 
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Figure 6. Computed karyotype. For each chromosome, we depict (bottom to top) the computed copy 
numbers observed for SN291’s PBMC genome, cancer tissue, cell line and subclones. Red denotes 
deletions (haploid), blue represents expansions (triploid, with light blue representing tetraploid or 
higher). 


The complete loss of one copy of chr10 was observed in the SN291 cell line and corresponding 
subclones. The fragmented signal for the cancer tissue indicates the presence of this 
aneuploidy in a subpopulation of the mixed tissue. The chromosomal deletion is not observed in 
the PBMC sample, as expected. Similarly, we observed large-scale but partial losses in 
chromosomes 1, 9, 12 and 13. Conversely, we observed an extra copy (triploidy) of chr7 and 
most of chr19. 


Relationship between subclones of SN291 

Based on the aneuploidy analysis, it is evident that the five subclones are independent of each 
other (Figure 7). Each subclone presents a small number of minor private aneuploidies, none of 
which is shared by two or more subclones. 


Variant analysis 


Figure 7. Relationships between the SN291 
subclones and the parental cell line. Each 
subclone presents a very small number of 
private aneuploidies (deletions or expansions). 


Ingenuity Variant Analysis is a web-based 
application that helps researchers study human 
disease by identifying causal variants from 
human sequencing data. Ingenuity Variant 
Analysis was applied to the ten genome 
sequences to identify candidate variants 
associated with the glioblastoma phenotype, 
using the tissue, cell line and five subclones as 
“cases”. We required candidate variants to be 
predicted deleterious, observed in at least three 
“cases” with quality >= 35, and with population 
frequency under 1%. We used Ingenuity’s 
knowledgebase to select cancer driver variants 
directly affecting genes known to be involved in 
GBM. As shown in Figure 8, a number of 
interesting gene mutation candidates were 
identified. Of particular interest is a stop-gain 
SNV_ in the RAD51B gene, present in 
heterozygous form in the genome of the patient 


(PBMC), the cancer tissue, the cell line and all the subclones. This variant is very rare, with a 
population frequency of 0.0079% (as computed using Kaviar genome database) and is 
confirmed by its presence in the daughter (but absent in the son). A second variant of interest is 
a novel missense SNV in DVL2, predicted to be deleterious. 
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Figure 8. Identification of detrimental variants. Orange and blue denote gain of function and loss of 
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RNA-Seq Data Analysis 

A number of published computational algorithms have been evaluated for analyzing RNA-seq 
data (Figure 9). To fully extract gene expression information from glioblastoma tumor tissues 
and clonal populations for biomarker discovery, we have decided to use TopHat based on its 
more general application, robust performance, and wide acceptance in the community, in 
addition to the fact that our single cell RNA-seq data is of lower coverage. 


Unspliced: Bowtie, BWA, BFAST, Mag, ... 


RUM, 2011, (93) 
RNASEQR, 2012, (11) 


— Annotation-guided = OSA, 2012, (17) 
Alignment SAMMate, 2011, (18) 


MapAl, 2012, (1) 


ContextMap, 2012, (7) 
MapSplice, 2010, (196) 
De novo ABMapper, 2011, (17) 
CRAC, 2013, (6) 
HMMSplicer, 2010, (49) 


Spliced — 


GEM, 2013, (0) 
STAR, 2013, (91) 

_ Both TopHat, 2009, (2121) 
SpliceMap, 2010, (182) 
TopHat2, 2013, (165) 


Figure 9. Evaluation of published RNA-seq data analysis algorithms. 
Numbers in parentheses indicate number of citations for each program. 


Single Cell RNA-Seq data for cultured SN291 tumor cells. 

We have successfully generated single cell RNA-Seq data from SN291 tumor cells. cDNA 
libraries from 96 individual single cells, derived from patient SN291, were prepared, indexed and 
subjected to next generation sequencing on an Illumina HiSeq platform. Approximately 1-2 
million quality reads from 87 cells were generated; ~60% of reads can be mapped concordantly 
to the human genome. For nine cells insufficient sequencing reads were produced and thus 
these were excluded from further analysis (Table 4). 


Principle component analysis and network mapping. 

Principle component analysis was performed on the 87 single cell transcriptomes for SN291. As 
shown in Figure 10, several distinct cell clusters were identified. Our collaborators at ISB used 
their previously published work (PNAS, 2011) to evaluate the enrichment pattern for CD133+ 
gene signatures. SN291 cells bearing CD133+ signature (red) show distinct separation from 
those cells negative for the signature. One cell (purple) shows a strong enrichment for Wnt 
signaling pathway genes. 
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249 | 1,326,584 | 66.1% | 19.7% | 
| go2 | 1,191,278 | 47.5% | 36.3% | 
| e03 | 1,612,721] 62.1% | 20.9% | 
| p06 | 1,797,089] 60.4% | 22.1% | 
| gos | 1,351,292] 55.6% | 73.4% | 
| f09 | 1,605,567] 616% | 215% | 1 | 
jell | 1,176,268] 49.4% | 34.2% 59] 1.476.244 | 64.9% | 

| g60 | 1,330,170] 65.6% 

Pasi [1,706,541 | 64.0% | 22.5% | 1 | 
|g62_| 1,682,185 | 64.9% | 20.4% | 
Pasa [1,291,539 66.3% | 16.3% | 1 | 
jg66 | 546,529 | 50.9% | 33.4% | 
|. g69 | 1,306,388 | 55.0% | 28.9% | 
[ass [aroseo [ersx [zasn ta esi [1,304,525 | 59.4% | 25.0% | 1 | 
jaf e82_ [1,708,550] 62.9% | 247% | 
| e83 | 1,493,367] 68.9% | i8i% | 2 | 
| gs4 | 508605 | 56.4% | 76.8% | 1 
ese | 731,840 | 47.7% | 35.7% | 
1e89 | 1,145,143 | 47.6% | 35.0% 1 
| e90 | 1,706,828 | 65.2% | 18.0% | 1 
pe44 [1,175,265 | 55.7% | 27.6% | 


Table 4. RNA-seq analysis of 96 single cells from patient SN291. 


Refinement of RNA samples from clonal populations for RNA-seq analysis 

One of the key components of this study will be to compare gene expression profiles to the 
whole genome sequencing data generated from the same clonal populations. In order to 
achieve this goal, total RNAs have been extracted from the parental SN243 and SN291 cell 
lines and five to six of their subclones. RNA integrity has been validated and sample prep for the 
RNA-seq will be completed in the coming months prior to sample submission. 
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Figure 10. Principle component analysis of RNA-seq data generated from 
individual cells from the SN291 parental culture. Each dot represents a 
single cell. Color gradient indicates enrichment score for either our 
published CD133+ gene signature (PNAS, 2011) or Wnt pathway genes. 


Specific Aim 3. Parental tumor cell lines and subclones have been expanded for proteomic 
analysis aimed at identifying candidate protein biomarkers for glioblastoma. Extraneous proteins 
were removed from culture conditions to allow for the identification of proteins secreted directly 
by quantized cell populations. Over the last year, cell pellets and secreted protein preparations 
have been collected for the SN291 parental line and five subclones. Similarly, cell pellets and 
secreted protein preparations have been prepared for the SN243 parental line and six 
subclones. SN260 and SN348 parental cell lines have also been prepped for comparative 
proteomic analysis. In summary, cell pellets and secreted protein samples were prepared from 
four parental cell lines and eleven subclones derived (Figure 11). 


We have begun deep proteome analysis of the secretomes by performing off-gel fractionation of 
the peptide digests of each of the secretome preparations and have combined the analysis of 
each of the fractions to prepare lists of differentially secreted proteins from each of the cell lines 
(progress to date shown in Table 5). The cell pellets have been further processed by our 
collaborators at ISB to study the cell surface proteome using N-glyco capture technology and 
subsequent analysis by liquid chromatography-mass spectrometry (LC-MS/MS). The N- 
glycosylated proteins are of interest in the context of biomarker identification strategies and 
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expected to provide insight in differentially expressed signatures of glioblastoma patients. The 
secretome preparations of the four parental cell lines (GN243, SN291, SN260 and SN348) and 
eleven subclones yielded enough material for subsequent LC-MS/MS analysis. The 15 samples 
were subjected to tryptic digestion using a standard protocol including the reduction of disulfide 
bonds with dithiothreitol and alkylation of the sample with iodoacetamide. Samples were 
analyzed in duplicate using a high resolution QExactive mass spectrometer (Thermo Fisher 
Scientific) allowing peptide fragmentation by higher-energy collisional dissociation. Peptides 
were separated on a reversed phase column using a particular long gradient of 4h and nano-LC 
conditions to allow highly sensitive in depth analysis of the secretome of each tumor cell 
sample. 


primary and quantized cells derived from tumor 
SN243 SN260 SN291 SN348 


parental parental parental parental 3 
subclone# ‘1 subclone # 2 (media) 
subclone# 2 subclone # 3 & 

subclone# 4 subclone # 4 cell surface proteome 
subclone# 6 subclone # 5 (pellet) 
subclone# 7 subclone #10 

subclone #12 


discovery proteomics 


secretome preparation 


> Selection of differentially expressed markers + markers from WGS 


blood samples from patients and family members 
parent 1 patient parent 1 targeted proteomics 
parent 2 sibling 1 parent 2 
patient child 1 patient 
child 1 child 2 sibling 1 
child 1 


blood samples from other GBM patients SRM 


patient 
SN327 


shs2o -UULULLL 


> Validation of markers by selected reaction monitoring 


Figure 11. Samples prepared for proteomic analysis to identify candidate protein biomarkers for 
glioblastoma. 


Proteomic data are analyzed through sequence database searching using the software tool 
suite of the Trans-Proteomic-Pipeline (developed at ISB) for the correct assignment of MS 
spectra to peptides and to infer the proteins from these peptide identifications. A standard 
database would allow detection of known proteins but not the detection of mutational changes 
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from the tumor genome or the tumor derived quantized cells. To include such mutations, an 
extended cancer genome specific database will be generated that considers the results from the 
whole genome sequence and allows for a correlation of specific mutations arising from these 
tumor cells on the proteome level. This will be done for all SN291 samples (for which full 
genome sequence information is available) and subsequently for SN243 once whole genome 
sequencing results are received from Complete Genomics. 


Proteomics - Secretome Analysis of Parental Cell Lines and Subclones 
I LC-MS/MS 
ee = digestion MN data analysis comment 
preparation analysis 
SN291 parental v v in progress 


media blank 1 v in progress 


cell line type cell culture 


SN291_noS parental in progress supplementals further removed 


SN291 subclone 2 in progress 
SN291 subclone 3 in progress 
SN291 subclone 4 in progress 
SN291 subclone 5 in progress very slow growing 
SN291 subclone 10 in progress 


SN243 parental in progress 
media blank 2 in progress 


SN243_noS parental 
media blank 3 


supplementals further removed 


AN 


SN243 subclone 1 


(green = subclone 2 
subclones 


in progress 
in progress 
subclone 3 
subclone 4 


selected for 


proteomics) In progress 


subclone 5 
subclone 6 in progress 
subclone 7 


subclone 8 


in progress 


subclone 9 
subclone 10 


QLNNNNNNNNNS 


subclone 12 in progress 


S 


SN260 parental v v v in progress 


SN348 parental v v v v in progress 


Table 5. Progress on proteomic secretome analysis of parental cell lines and subclones. 


Biomarker candidates derived from this discovery proteomic analysis will be correlated with the 
data derived from the transcriptome and whole genome analysis to define the final list of 
candidates that will be subjected to targeted quantitative proteomic selected-reaction monitoring 
(SRM) analysis. Blood samples have been collected from patient SN243 and SN291, and of 
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three direct family members from each patient. We have also collected additional blood plasma 
samples from patient SN348 and four direct family members as well as other patients (not 
selected for whole genome sequencing) to expand the patient population available for targeted 
SRM analysis of the selected candidates. This will enable us to evaluate GBM specific tumor 
markers in this cohort of patients as well as the larger pool of 100 plasma samples from both 
GBM patients and normal subjects. 


Database construction for cancer derived mutational proteome analysis. 

Our collaborators at ISB have begun to construct a database for cancer derived mutational 
proteome analysis using the existing whole genome sequencing data from SN291. This will be 
expanded to include SN243 once the data is available (see PI Moritz report). 


Specific Aim 5. We have developed a 


drug screening protocol for primary acive agents 


Class* Total in class 


GBM cell lines (7). In our previous SN143_ | SN186 


study, SN143 and SN186 parental cells 
lines were tested against a library of 
2,000 compounds composed primarily 
of FDA approved compounds (50%), 
natural products (30%), and other 
bioactive components (20%) 
(MicroSource Spectrum — collection, 
MicroSource Discovery Inc.), to identify 
inhibitors of GBM stem cell (GSC) 


Alcohol antagonist 
Antihelminthic 
Antiarrhythmic 

Antibacterial 
Antifungal 
Antineoplastic 
Antihyperlipidemic 


Antihypertensive 


{ 
7 


proliferation. As shown in Table 6, this 


Anti-infective 
drug screening assay was able to 
identify approximately 100 compounds el cl 
that were cytotoxic against SN143 and Cardiotonic 
SN186. The compounds _ identified Diuretic 
represent multiple classes of drugs and H1 antihistamine 


natural products, including 
antineoplastics, cardiotonics, 
antihelminthics, and others. For Aim 5, 
our goal is to apply this methodology to 
quantized cells and use this procedure 
to identify therapeutic agents with the | Undetermined activity A44 19 

potential to target the stem cell Total agents 1101b 92 105 
population. In particular, we will 
compare the drug responsiveness of 
the parental cultures with the 
corresponding sub clone populations. We hope this will provide valuable information that can be 


Immunosuppressant 
Psychotropic 
Sclerosing agent 


Vasodilator 35 


Table 6. Pharmacological classes for inhibitors of GSC 
proliferation of SN143 and SN186 (7). 
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translated to the clinic and used to design effective treatment strategies for future GBM patients. 
Given that SN243 and SN291 samples have been used for whole genome sequencing, we will 
apply a similar drug screening approach to both these primary cell lines and their corresponding 
subclones. Taking into consideration the number of cells required for drug screening and the 
growth kinetics of the quantized cells, we have decided to pursue a 160-oncology focused 
library. This library is composed of FDA approved antineoplastics as well as compounds in late 
phase clinical trials, several of which include glioblastoma-relevant drugs such as PI3K/ mTOR 
inhibitors, VEGFR inhibitors and met-inhibitors. 


To date, we have optimized drug screening assays with the 160-compound library using the 
SN186 parental cell line as shown in Figure 12. Drug potency is typically assessed by 
determining the half maximal inhibitory concentration (ICs) (/.e. the concentration of a drug that 
is required for 50% inhibition in vitro). With this in mind, we generated 8-point dose response 
curves to access the ICs, of each drug (Figure 12B). This same technique will be applied to 
SN243 and SN291 parental cell lines, and their corresponding quantized cell populations in the 
final year. 


A Cells into 384-well plates (1000/well) 


Cells attach overnight (37°C, 5% COz) 


Add different compounds (0.01-10 uM) 


Incubate for 96 hrs (37°C, 5% COs) 


Add Cell Titer-Glo for cell lysis (15 mins) 


© 
Oo 


% cell viability 
[op) 
oO 


Record luminescent signal on Envision 


10° 107 10° ear 


Signal = ATP = metabolically active cells 
[Compound] (M) 
Figure 12. (A) Established protocol for high throughput chemical screens. Viable cells 
(metabolically active) are determined by ATP quantification, using the CellTiter-Glo 
luminescent cell viability assay (Promega). (B) Example of dose response curve used to 
determine IC50 value for each drug candidate. 


We have encountered a significant logistical problem with the explosion of whole genome 
sequencing requests and capacity by our contractor Complete Genomics Inc. We have 
addressed these issues with them and are progressing towards the completion of this project. 
With large changes in staff at CGI, it has taken time to reestablish an error free system and we 
have been subjected to these delays. Other technical problems we encountered were solved 
and we do not anticipate problems with the proposed work schedule for the next 12 months. Our 


Vd: 


efforts in whole genome sequencing have advanced since or request for a no-cost extension of 
the proposal and bode well for the upcoming work to be completed. This program will be 
completed as stated and has benefited from further significant technical developments we have 
made and will provide significant results over the next 12 months. 


KEY RESEARCH ACCOMPLISHMENTS FOR 2013-2014 


We have established five GBM primary cell lines suitable for extensive molecular analysis 
(as listed in Table 1). For three of these patient samples (SN243, SN291 and SN348), we 
were able to get full family consent and have therefore collected blood from the patient’s 
family members in addition to the blood and tumor specimens collected from the GBM 
patients (Table 2). These samples are suitable for both cellular analysis, genomic analysis 
and proteomic analysis. 


Quantized cell populations have been established from SN243 and SN291 parental tumor 
cell lines. 


Whole genome sequencing has been completed for patient SN291, to include tumor, 
primary cell line, five separate quantized cells and SN291 family members. 


Key mutations in the cancer genome of SN291 have been identified. 


Transcriptomic profiling of single cells from SN291 quantized populations has been 
performed. Distinct expression patterns of selected genes have been identified. 


Cell culture conditions for secretome analysis of quantized cells and protein extraction 
conditions to maximize the amount of protein for high-mass accuracy quantitative mass 
spectrometry have been developed. 


Deep proteome secretome analysis of individual GBM primary cells and their quantized 
subclones has been performed. 


Initial proteome identifications from GBM tumor samples have been made. Cancer proteome 
specific database strategies to identify protein mutations predicted by whole genome 


sequencing have been established. 


A drug screening protocol has been developed for screening SN243 and SN291 parental 
lines and quantized cell populations. 
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REPORTABLE OUTCOMES 
Publication describing technical developments applied to GBM cell analysis: 


Sangar V, Funk CC, Kusebauch U, Campbell DS, Moritz RL, Price ND. Quantitative proteomic 
analysis reveals effects of EGFR on invasion-promoting proteins secreted by glioblastoma cells. 
Mol Cell Proteomics. 2014 Jul 5. pii: mcp.M114.040428. PMID: 24997998. 


CONCLUSION 


Description of work to be performed over the next 12 months. 

We will complete our whole genome sequence collection for patient and family SN243. This will 
complete our data collection for whole genome sequencing and provide a second extensive 
dataset to supplement the efforts we have made significant inroads on. We have established a 
data analysis pipeline wit patient SN291 and their family members and will complete this 
analysis. 


We will complete our full transcript analysis of single quantized cells from both SN291 and 
SN243 to establish expressed gene and mutations identified in the whole genome sequence 
analysis of these sample types. 


We will complete our proteomics analysis of the full sample set of expanded GBM patients and 
of the family members from two of these undergoing full genomic analysis. We will perform 
tumor proteome specific analysis of GBM primary and quantized cell lines to provide 
quantitative differences between these as well as detect and quantify expressed protein 
mutations identified from the multi-omic approach. 


We will compile a candidate biomarker list of select targets derived from WGS, single cell 
RNA-seq and cancer proteome data of both whole cells as well as quantize cell secretomes to 
build targeted assays for candidate biomarker evaluation. We will construct a panel of SRM 
assays and analysis conditions to deploy across GBM patient and normal subject plasma 
samples to conduct a biomarker evaluation for early detection of GBM tumors in both a 
unblended and blinded analysis. 


We will complete our targeted drug screening profile of GBM patient primary and quantized cells 


to reveal inhibitors of GBM stem cell proliferation and compile a list of drugs that have the 
potential to be tested in Phase-ll clinical trials. 
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